HIERARCHAL CLUSTERING AND SIMILARITY MEASURES ALONG WITH MULTI REPRESENTATION
نویسندگان
چکیده
منابع مشابه
Study and Analysis of Multi - viewpoint clustering with similarity measures
Abstract The database object that describes tens of attributes is referred as high dimensional data space. In high dimensional data, the common distance measures can be influenced by noise. Existing clustering algorithms are implemented based on partitioning, hierarchical, density based and grid based. These methods assume some kind of cluster relationship among the clustered objects. Similarit...
متن کاملSimilarity Measures for Multi-valued Attributes for Database Clustering
This paper introduces an approach to cope with the representational inappropriateness of traditional flat file format for data sets from databases, specifically in database clustering. After analyzing the problems of the traditional flat file format to represent related information, a better representation scheme called extended data set that allows attributes of an object to have multi-values ...
متن کاملSimilarity Measures for Writer Clustering
JAYASHREE SUBRAHMONIA IBM T.J. Watson Research, P.O. Box 218 / Route 134, Yorktown Heights, NY 10598, U. S. A. E-mail: [email protected] This paper addresses the problem of improving the performance of an online, writer-independent, large-vocabulary, unconstrained, handwriting recognition system by clustering writers with similar writing styles. Recognition performance is enhanced by identify...
متن کاملSimilarity Measures and Clustering of String Patterns
Clustering is a powerful tool in revealing the intrinsic organization of data. A clustering of structural patterns consists of an unsupervised association of data based on the similarity of their structures and primitives. This chapter addresses the problem of structural clustering, and presents an overview of similarity measures used in this context. The distinction between string matching and...
متن کاملXML schema clustering with semantic and hierarchical similarity measures
With the growing popularity of XML as the data representation language, collections of XML data have exploded in numbers. The methods are required to manage and discover the useful information from them for improved document handling. We present a schema clustering process by organising heterogeneous XML schemas into groups. The methodology considers not only the linguistic and the context of t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Research in Engineering and Technology
سال: 2013
ISSN: 2321-7308,2319-1163
DOI: 10.15623/ijret.2013.0208012